User speech interfaces for interactive media guidance applications

ABSTRACT

A user speech interface for interactive media guidance applications, such as television program guides, guides for audio services, guides for video-on-demand (VOD) services, guides for personal video recorders (PVRs), or other suitable guidance applications is provided. Voice commands may be received from a user and guidance activities may be performed in response to the voice commands.

This application claims the benefit of U.S. provisional patentapplication No. 60/194,949, filed Mar. 31, 2000.

BACKGROUND OF THE INVENTION

This invention relates to interactive media guidance applications andmore particularly, to user speech interfaces for interactive mediaguidance applications.

Interactive media guidance applications, such as television programguides, guides for audio services, guides for video-on-demand (VOD)services, and guides for personal video recorders (PVRs), guide users intheir access of media offered through media distribution systems such ascable television systems, digital broadcasting services, and theInternet. Home recording devices, such as videocassette, recorders(VCRs), audiocassette recorders, digital versatile disc (DVD) recorders,compact disc (CD) recorders, and PVRs such as those provided byReplayTV™ and TiVo™, allow users to record media for later access. Thesedevices supplement the media that is available to users via distributionsystems. Such devices and distribution systems, whether used separatelyor used in combination, provide users with vast amounts of media fromwhich to choose. Accordingly, it would be desirable to increase the easewith which users can navigate through media or access guidanceapplication features.

It is therefore an object of the present invention to provide a userspeech interface for interactive media guidance applications that allowsusers to navigate through media or access guidance application features.

SUMMARY OF THE INVENTION

This and other objects of the present invention are accomplished byproviding a user interface with speech recognition. The user interfacewith speech recognition allows users to navigate within interactivemedia guidance applications, such as television program guides, guidesfor audio services, guides for video-on-demand (VOD) services, guidesfor personal video recorders (PVRs), or other suitable guidanceapplications. Such an interface may provide hands-free control over somefeatures of such media guidance applications. Various embodiments andfeatures of the present invention are described, for example, in U.S.provisional patent application No. 60/194,949, which is herebyincorporated by reference herein in its entirety.

In some embodiments a device such as a wired or wireless microphone maybe used to receive guidance voice commands from a user. When a singlemicrophone is used, it may be located in a communal place in a room.Speakers may be used to provide audible outputs.

Some embodiments may receive voice commands from a user and may performguidance activities in response to the voice commands. Such guidanceactivities may include, for example, source selection, browsing,obtaining media information, recording, playing back, or any othersuitable guidance activity. Guidance activities may be performed onmedia sources which may be provided by interactive media guidanceapplications. Such media sources may include, for example, televisionchannels, audio channels, video-on-demand content, recorded content(e.g., content recorded to a PVR, videocassette recorder (VCR), compactdisc (CD) recorder, digital versatile disc (DVD) recorder, or any othersuitable recording device), Internet-delivered content (e.g., websites,video, or any other suitable Internet-delivered content), or any othersuitable media source.

Some embodiments may select media sources in response to received voicecommands. Source selection may be performed from any suitable mediasource to select-content from any other suitable media source. Forexample, a user may be provided with content from a first media source.Upon receiving a voice command indicating content from a second mediasource, the interactive media guidance application may perform thesource selection and may provide the user with the content from thesecond media source. Thus, the content from the first media sourceprovided to the user may be replaced or augmented by content from thesecond media source in response to the received voice command. Inanother suitable approach, some feature of a second source may beprovided to a user from a first source. The first and second mediasources may be two different types of media sources. In another suitableapproach, the first and second media sources may be of the same type.

Some embodiments may perform browsing operations in response to receivedvoice commands. Browsing operations may be performed from any suitablemedia source to browse media available for any other suitable mediasource. For example, a user may be provided with content from a firstmedia source. Upon receiving a voice command indicating content from asecond media source for browsing, the interactive media guidance,application may provide the user with a browse display for the content.Thus, in response to the received voice command, a browse display may beprovided as an overlay on the content from the first media source. Inanother suitable approach, the overlay may be provided on a displaydevice separate from the display device that includes the content fromthe first source. In one suitable approach, the first and second mediasources may be two different types of media sources. In another suitableapproach, the first and second media sources may be of the same type.

Some embodiments may provide users with information corresponding to amedia source in response to received voice commands. Responses toinformation requests may be provided from any suitable media source andmay provide information for any other suitable media source. Forexample, a user may be provided with content from a first media source.Upon receiving a voice command indicating an information request forcontent from a second media source, the interactive media guidanceapplication may provide the user with audible information related to thecontent from the second source. In another suitable approach, theinteractive media guidance application may provide the user with visualinformation corresponding to the audible information related to thecontent. Thus, audio for the content from the first media source may bereplaced or augmented by audible information related to the content fromthe second media source in response to the received voice command. Inanother suitable approach, the volume of audio for the content from thefirst media source may be lowered or muted.

Some embodiments may record content from a media source in response toreceived voice commands. Recording may be performed from any suitablemedia source. For example, a user may be provided with content from afirst media source. Upon receiving a voice command indicating that theuser wishes to record content from a second media source, such as“record,” the content may be recorded to a desired recording device.This is an example of “one word record” functionality. A defaultrecording device (e.g., DVD recorder, PVR, VCR, CD recorder, or anyother suitable recording device) may be specified by the user at sometime prior to recording such that upon receiving the command “record,”the content from the second media source is automatically recorded tothe default device.

In another suitable approach, the device on which to record the contentfrom the second source may be specified by receiving an additionalcommand from the user, such as “PVR,” “VCR,” “DVD recorder,” “CDrecorder,” or any other command corresponding to a suitable recordingdevice. In yet another suitable approach, multiple users may be providedfor such that a file to record the content from the second source may bespecified by the user. For example, an audible question may be providedto the user, such as, “What file?” To this question, a user name such as“Dad” may be received from the user. Thus, the content may be recordedto Dad's file. In another suitable approach, a specific file to recordthe content from the second media source May be chosen based on voicerecognition. For example, a voice command to record may be received froma user. A voice recognition system may recognize the identity of theuser based on the user's voice, based on the microphone that receivedthe voice command, or based on any other suitable approach. The contentfrom the second media source may record to a file associated with theidentity of the user.

Some embodiments may play back previously-recorded media source contentin response to received voice commands. Playing back may be performedfrom any suitable media source content. For example, a user may beprovided with content from a first media source. Upon receiving a voicecommand indicating that the user wishes to play back previously-recordedcontent from a second media source, such as “play back,” thepreviously-recorded content may be played back from a desired recordingdevice. A default recording device (e.g., DVD recorder, PVR, VCR, CDrecorder, or any other suitable recording device) may be specified bythe user at some time prior to playing back such that upon receiving thecommand “play back,” the previously-recorded content is automaticallyplayed back from the default device.

In another suitable approach, the device from which to play back thepreviously-recorded content may be specified by receiving an additionalcommand from the user, such as “PVR,” “VCR,” “DVD recorder,” “CDrecorder,” or a command corresponding to any other suitable recordingdevice. In yet another suitable approach, multiple users may be providedfor such that a file from which to play back the previously-recordedcontent may be specified by the user. For example, an audible question,may be provided to the user, such as, “What file?” To this question, auser name such as “Dad” may be received from the user. Thus, the contentmay be played back from Dad's file. In another suitable approach, aspecific file from which to play back the content may be chosen based onvoice recognition. For example, a voice command to play back may bereceived from a user. A voice recognition system may recognize theidentity of the user based on the user's voice, based on the microphonethat received the voice command, or based on any other suitableapproach. The content may be played back from a file associated with theidentity of the user.

Some embodiments may receive selections of features for an audiblefavorites list from a user and may provide, the features of thefavorites list to the user in an audible scan. For example, a menu fromwhich features may be selected for an audible favorites list may beprovided to a user. Such a menu may include, for example, non-mediaoptions (e.g., sports, weather, news, stocks, traffic, or any othersuitable non-media option), media-related options (e.g., channels,programs, settings, reminders, advertisements, or any other suitablemedia-related option), or any other features suitable for inclusion inthe audible favorites list. The interactive media guidance applicationmay provide information related to the user-selected features in anaudible scan. This information may be provided to the userautomatically. In another suitable approach, this information may beprovided to the user in response to a user voice command requesting tolisten to the audible favorites. The audible information may be providedusing audio clips or files associated with the favorites features,streamed audio, audio generated from text, or using any other suitableapproach. The user may interact with the information provided in theaudible scan.

Some embodiments may perform features specific to a user based on theuser's identity. For example, a voice command from a user may bereceived by a microphone. A voice recognition system may determine theuser's identity. The voice recognition system may be able to identify auser by the sound of the voice giving the voice command, by themicrophone used to give the voice command (e.g., by associating aspecific user with a specific microphone for receiving voice commands),or by any other suitable approach for user identification. The identityof the user may be used to perform features specific to the user. Suchfeatures may include, for example, customized guide interfaces, targetedadvertisements, audience measuring, favorites, or any other suitableuser-specific feature.

Some embodiments may provide a user with targeted advertisements basedon the user's monitored actions. For example, actions of a user may bemonitored to determine advertisements appropriate for the user. A voicecommand from the user may be received by a microphone. A voicerecognition system may determine the user's identity. The voicerecognition system may be able to identify a user by the sound of thevoice giving the voice command, by the microphone used to give the voicecommand (e.g., by associating a specific user with a specific microphonefor receiving voice commands), or by any other, suitable approach foruser identification. The user may be provided with targetedadvertisements based on the monitored actions. In one suitable approach,targeted advertisements may be associated with a user for use in thefuture.

Some embodiments may determine the identity of a user and measureaudience information based on the user's profile. For example, a voicecommand from a user may be received by a microphone. A voice recognitionsystem may determine the user's identity. The voice recognition systemmay be able to identify a user by the sound of the voice giving thevoice command, by the microphone used to give the voice command (e.g.,by associating a specific user with a specific microphone for receivingvoice commands), or by any other suitable approach for useridentification. The user may be provided with content from a mediasource. Audience information for the media source content may bemeasured by using a user profile associated with the user. The userprofile may include demographic data such as, for example, age, gender,nationality, profession, income, location, or any other suitable data.The audience information may be measured using this data to create adetailed description of the audience.

Some embodiments may determine the identity of a user and may providethe user with favorites features based on the user's identity. Favoritesfeatures may include a user's favorite television channels, websites,audio channels, or any other suitable feature. For example, a voicecommand from a user may be received by a microphone. A voice recognitionsystem may determine the user's identity. The voice recognition systemmay be able to identify a User by the sound of the voice giving thevoice command, by the microphone used to give the voice command (e.g.,by associating a specific user with a specific microphone for receivingvoice commands), or by any other suitable approach for useridentification. The user may by provided with favorites features basedon the user's identity.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and other objects and advantages of the invention will beapparent upon consideration of the following detailed description, takenin conjunction with the accompanying drawings, in which like referencecharacters refer to like parts throughout, and in which:

FIG. 1 is a schematic block diagram of an illustrative system inaccordance with one embodiment of the present invention;

FIGS. 2A-2E show illustrative arrangements for the interactive mediaguidance application equipment of FIG. 1 in accordance with severalembodiments of the present invention;

FIG. 3 is an illustrative schematic block diagram of the user televisionequipment of FIGS. 2A-2E in accordance with one embodiment of thepresent invention;

FIG. 4 is a generalized schematic block diagram of portions of theillustrative user television equipment of FIG. 3 in accordance with oneembodiment of the present invention;

FIGS. 5A-5B-show illustrative interactive displays in accordance withone embodiment of the present invention;

FIG. 6 is a flowchart of illustrative steps involved in source selectionin an interactive media guidance application in accordance with oneembodiment of the present invention;

FIGS. 7A-7B show illustrative interactive displays in accordance withone embodiment of the present invention;

FIG. 8 is a flowchart of illustrative steps involved in browsingoperations in interactive media guidance applications in accordance withone embodiment of the present invention;

FIG. 9 is a flowchart of illustrative steps involved in providingaudible information corresponding to a media source in accordance withone embodiment of the present invention;

FIG. 10 is a flowchart of illustrative steps involved in recordingcontent from a media source in accordance with one embodiment of thepresent invention;

FIG. 11 is a flowchart of illustrative steps involved in playing backcontent from a previously-recorded media source in accordance with oneembodiment of the present invention;

FIG. 12 shows an illustrative interactive display in accordance with oneembodiment of the present invention;

FIG. 13 shows another illustrative interactive display in accordancewith one embodiment of the present invention;

FIG. 14 is a flowchart of illustrative steps involved in receivingselections of features for an audible favorite's list from a user andproviding the features of the favorites list to the user in an audiblescan in accordance with one embodiment of the present invention;

FIG. 15 is a flowchart of illustrative steps involved in determining auser's identity and providing a feature specific to the user inaccordance with one embodiment of the present invention;

FIG. 16 is a flowchart of illustrative steps involved in determining auser's identity and providing targeted advertisements to the user inaccordance with one embodiment of the present invention;

FIG. 17 is a flowchart of illustrative steps involved in determining auser's identity and measuring an audience based on the user's profile inaccordance with one embodiment of the present invention; and

FIG. 18 is a flowchart of illustrative steps involved in determining auser's identity and providing a favorites feature in accordance with oneembodiment of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

The speech interface for interactive media guidance applications of thepresent invention may be provided using any suitable hardware platform.Suitable platforms may include, for example, hardware such as satellitereceivers, personal computer televisions (PC/TVs), personal computers(e.g., with television tuner cards), cable set-top boxes, or any othersuitable hardware. One suitable interactive media guidance applicationis a television program guide. Illustrative interactive televisionprogram guide systems are described, for example, in Knee et al. U.S.Pat. No. 5,589,892 and Knudson et al. U.S. patent application Ser. No.09/357,941, filed Jul. 16, 1999, which are hereby incorporated byreference herein in their entireties. Client-server program guidesystems are described, for example, in Ellis et al: U.S. patentapplication Ser. No. 09/374,043, filed Aug. 13, 1999, which is herebyincorporated by reference herein in its entirety. On-line program guidesystems are described, for example, in Boyer et al. U.S. patentapplication Ser. No. 08/938,028, filed Sep. 18, 1997, which is herebyincorporated by reference herein in its entirety.

The interactive media guidance applications may allow users to recordmedia on digital or analog storage devices (e.g., videocassettes, harddisks, floppy discs, flash memory, recordable compact discs (CDs),recordable digital versatile discs (DVDs), or any other type ofstorage). Interactive program guides having digital storage aredescribed, for example, in Hassell et al. U.S. patent application Ser.No. 09/157,256, filed Sep. 17, 1998, which is hereby incorporated byreference herein in its entirety. Recording of media can also beperformed by a media guidance application or other server. Client-serverbased program guides with remote server recording are described, forexample, in Ellis et al. U.S. patent application Ser. No. 09/332,244,filed Jun. 11, 1999, which is hereby incorporated by reference herein inits entirety. On-line media guidance applications may also record mediaor direct a user's equipment to record media.

An illustrative system 100 in accordance with one embodiment of thepresent invention is shown in FIG. 1. Main facility 120 providesguidance application data from guidance application data source 160 tointeractive media guidance application equipment 130 via communicationslink 110. There may be multiple guidance application data sources butonly one has been shown to avoid over-complicating the drawing. Ifdesired, guidance application data sources may be located at facilitiesseparate from main facility 120, such as at local information service.150, and have their data provided to main facility 120 for localizationand distribution. Guidance application data-source 160 may be anysuitable computer or computer-based system for obtaining data (e.g.,manually from an operator, electronically via a computer network orother connection, or via storage media) and putting the data intoelectronic form for distribution by main facility 120. Link 110 may be asatellite link, a telephone network link, a cable or fiber optic link, amicrowave link, an Internet link, a combination of such links, or anyother suitable communications link. Video signals may also betransmitted over link 110 if desired.

Local information service 150 may be any suitable facility for obtainingdata particular to a localized region and providing the data to mainfacility 120 over communications link 140. Local information service 150may be, for example, a local weather station that measures weather data,a local newspaper that obtains local high school and college sportinginformation, or any other suitable provider of information. Localinformation server 150 may be a local business with a computer forproviding main facility 120, with, for example, local ski report,fishing conditions, menus, or any other suitable provider ofinformation. Link 140 may be a satellite link, a telephone network link,a cable or fiber optic link, a microwave link, an Internet link, acombination of such links, or any other suitable communications link.

The guidance application data transmitted by main facility 120 tointeractive media guidance application equipment 130 may includeprogramming data (e.g., program identifiers, times, channels or otherindicators of source, program titles, and program descriptions) andother data for services other than program listings (e.g., help text,pay-per-view information, weather information, sports information, musicchannel information, associated. Internet web links, associatedsoftware, or any other suitable information). There are preferablynumerous pieces or installations of interactive media guidanceapplication equipment 130, although only one is shown in FIG. 1 to avoidover-complicating the drawing.

Guidance application data may be transmitted by main facility 120 tointeractive media guidance application equipment 130 using any suitableapproach. Data files may, for example, be encapsulated as objectstransmitted using a suitable Internet based addressing scheme andprotocol stack (e.g., a stack which uses the user datagram protocol(UDP) and Internet protocol (IP)).

An interactive media guidance application is implemented, on interactivemedia guidance application equipment 130. Five illustrative arrangementsfor interactive media guidance application equipment 130 are shown inFIGS. 2A-2E. As shown, interactive media guidance application equipment130 may include distribution equipment 170 located at distributionfacility 180, and user television equipment 200 or personal computer240.

The interactive media guidance application may run totally on usertelevision equipment 200 using the arrangements of FIGS. 2A and 2C, ormay run partially on user television equipment 200 and partially oninteractive media guidance application equipment 130 using a suitableclient-server or distributed processing arrangement such as those shownin FIGS. 2B and 2D. Distribution facility 180 may be any suitabledistribution facility (e.g., a cable system headend, a broadcastdistribution facility, or any other suitable type of distributionfacility), and may have distribution equipment 170.

Distribution equipment 170 of FIGS. 2A, 2B, 2C, and 2D is equipmentsuitable for providing media guidance application data to usertelevision equipment 200 over communications path 190. In FIG. 2E,distribution equipment 170 may provide guidance application data toInternet service system 220 via, for example, a suitable computernetwork or Internet link. Distribution equipment 170 may include, forexample, suitable transmission hardware for distributing guidanceapplication data on a television channel sideband, in the verticalblanking interval (VBI) of a television channel, using an in-banddigital signal, using an out-of-band digital signal, or by any othersuitable data transmission technique. Analog or digital video signals(e.g., television programs) may also be distributed by distributionequipment 170 to user-television equipment 200 over communications paths190 on multiple television channels. In another suitable approach,videos may be distributed to user television equipment 200 from someother suitable distribution facility, such as a cable system headend, abroadcast distribution facility, a satellite television distributionfacility, or any other suitable type of television distributionfacility.

Communications paths 190 may be any communications paths suitable fordistributing guidance application data. Communications paths 190 mayinclude, for example, a satellite link, a telephone network link, acable or fiber optic link, a microwave link, an Internet link, adata-over-cable service interface specification (DOCSIS) link, acombination of such links, or any other suitable communications link.Communications paths 190 may have sufficient bandwidth to allowdistribution facility 180 or another distribution facility to distributeprogramming to user television equipment 200. There are typicallymultiple pieces of user television equipment 200 and multiple associatedcommunications paths 190, although only one piece of user televisionequipment 200 and communications path 190 are shown in FIGS. 2A-2D toavoid over-complicating the drawings. If desired, programming andguidance application data may be provided over separate communicationspaths.

FIG. 2B shows an illustrative arrangement for interactive media guidanceapplication equipment 130 in a client-server based or distributedinteractive media guidance application system. As shown in FIG. 2B,distribution equipment 170 may include guide server 210. Guide server210 may use any suitable combination of hardware and software to providea client-server based media guidance application. Guide server 210 may,for example, run a suitable, database engine (e.g., SQL Server byMicrosoft) and provide guidance application data in response to queriesgenerated by a guide client implemented on user television equipment200. If desired, guide server 210 may be located at main facility 120 orat another location, such as a cable system headend, a broadcastdistribution facility, a satellite television distribution facility, orany other suitable type of television distribution facility.

The media guidance application may retrieve guidance application datafrom guide server 210 using any suitable client-server based approach.The media guidance application may, for example, pass SQL requests asmessages to guide server 210. In another suitable approach, the mediaguidance application may invoke remote procedures that reside on guideserver 210 using one or more remote procedure calls. Guide server 210may execute SQL statements for such invoked remote procedures. In stillanother suitable approach, client objects executed by the media guidanceapplication may communicate with server objects executed by guide server210 using, for example, an object request broker (ORB). This may involveusing, for example, Microsoft's Distributed Component. Object Model(DCOM) approach.

The interactive media guidance application implemented on interactivemedia guidance application equipment 130 may communicate with guideserver 210 over communications path 190 using any suitable network andtransport layer protocols, if desired. They may communicate, forexample, using a protocol stack which includes Sequenced PacketExchange/Internetwork Packet Exchange (SPX/IPX) layers, TransmissionControl Protocol/Internet Protocol (TCP/IP) layers, AppletalkTransaction Protocol/Datagram Delivery Protocol (ATP/DDP) layers, DOCSISor any other suitable network and transport layer protocols.

FIGS. 2C and 2D show illustrative Internet-based interactive mediaguidance application systems. Distribution facility 180 may include, forexample, Internet service system 220. Internet service system 220 mayuse any suitable combination of hardware and software capable ofproviding guidance application data to the guide using an Internet basedapproach (e.g., the HyperText Transfer Protocol (HTTP)). If desired,Internet service system 220 may be located at a facility that isseparate from distribution facility 180.

In embodiments where the interactive media guidance application isimplemented on user television equipment 200 of interactive programguide television equipment 130 as shown in FIG. 2C, Internet servicesystem 220 (or other suitable equipment at distribution facility 180that is connected to Internet service system 220) may provide guidanceapplication data to user television equipment 200 via the Internet, orvia distribution equipment 170 using any suitable Internet-basedapproach (e.g., using the HyperText Transfer Protocol (HTTP) over aTransmission Control Protocol/Internet Protocol (TCP/IP) type link). Ifthe media guidance application implemented on interactive media guidanceapplication equipment 130 is a client-server guide as shown in FIG. 2D,guide server 210 may obtain guidance application data from Internetservice system 220. The media guidance application may also, however,obtain guidance application data from Internet service system 220 via anInternet connection.

In another suitable arrangement, distribution equipment 170 may includecomputer equipment or other suitable hardware on which a first portionor version of the interactive media guidance application is implemented.A second portion or version of the interactive media guidanceapplication may be implemented on user television equipment 200. The twoversions or portions of the interactive media guidance application maycommunicate using any suitable peer-to-peer communications scheme (e.g.,messaging, remote procedure calls, or any other suitable communicationsscheme) and perform interactive media guidance application functionsdistributively between distribution facility 180 and user televisionequipment 200.

Another suitable arrangement in which an on-line media guidanceapplication is implemented on interactive media guidance applicationequipment 130 is shown in FIG. 2E. On-line program guide systems aredescribed, for example, in above-mentioned Boyer et al. U.S. patentapplication Ser. No. 08/938,028, filed Sep. 18, 1997. The user may havepersonal computer (PC) 240 on which a media guidance application clientor web browser is implemented. Personal computer 240 may be connected toInternet service system 220 via Internet link 230. Internet servicesystem 220 may use any suitable combination of computer hardware andsoftware capable, of providing an on-line media guidance applicationserver application or website. Internet service system 220 is shown asobtaining, guidance application data from distribution facility 180. Inother suitable approaches, Internet service system 220 may obtaininformation from other systems such as, for example, main facility 120,local information service 150, or any other suitable source of guidanceapplication data.

An illustrative arrangement for user television equipment 200 is shownin FIG. 3. User television equipment 200 of FIG. 3 receives video or adigital video stream and data from distribution facility 180 (FIG. 2A),or some other suitable distribution facility, at input 250. Duringnormal television viewing, a user may tune set-top box 260 to a desiredchannel. The signal for that channel is then provided at video output270. The signal supplied at output 270 is typically either aradio-frequency (RF) signal on a predefined channel (e.g., channel 3 or4), or an analog demodulated video signal, but may also be a digitalsignal provided to television 280 on an appropriate digital bus (e.g., abus using the Institute of Electrical and Electronics Engineers (IEEE)1394 standard, (not shown)). The video signal at output 270 is receivedby optional secondary storage device 290.

The interactive media guidance application may run on set-top box 260,on television 280 (if television 280 has suitable processing circuitryand memory), on a suitable analog or digital receiver connected totelevision 280, or on digital storage device 300 (if digital storagedevice 300 has suitable processing circuitry and memory). Theinteractive media guidance application may also run cooperatively on asuitable combination of these devices. Interactive televisionapplication systems in which a cooperative interactive televisionprogram guide application runs on multiple devices are described, forexample, in Ellis U.S. patent application Ser. No. 09/186,598, filedNov. 5, 1998, which is hereby incorporated by reference herein in itsentirety.

Secondary storage device 290 can be any suitable type of analog ordigital program storage device (e.g., a videocassette recorder (VCR), apersonal video recorder (PVR), a digital versatile disc (DVD) recorder,or any other suitable device). Program recording and other features maybe controlled by set-top box 260 using control path 310. If secondarystorage device 290 is a VCR or a PVR, for example, a typical controlpath 310 involves the use of an infrared transmitter coupled to theinfrared receiver in the recorder that normally accepts commands from aremote control such as remote control 320. Remote control 320 may beused to control set-top box 260, secondary storage device 290, andtelevision 280. Audio input device 325 may be used to control set-topbox 260. Audio input device 325 may be a microphone or any othersuitable device for receiving audio input. Audio input device 325 may beincluded in remote control 320 if desired.

If desired, a user may record media such as programs, guidanceapplication data, or a combination thereof in digital form on optionaldigital storage device 300. Digital storage device 300 may be awriteable optical storage device (such as a DVD recorder capable ofhandling recordable DVD discs), a magnetic storage device (such as adisk drive or digital tape), or any other digital storage device.Interactive television program guide systems that have digital storagedevices are described, for example, in Hassell et al. U.S. patentapplication Ser. No. 09/157,256, filed Sep. 17, 1998, which is herebyincorporated by reference herein in its entirety.

Digital storage device 300 can be contained in set-top box 260 or it canbe an external device connected to set-top box 260 via an output portand appropriate interface. If necessary, processing circuitry in set-topbox 260 formats the received video, audio and data signals into adigital file format. The file format may be an open file format such asthe Moving Picture Experts Group (MPEG) MPEG-2 standard or the MovingJoint Photographic Experts Group (MJPEG) standard. The resulting data isstreamed to digital storage device 300 via an appropriate bus (e.g., abus using the Institute Electrical and Electronics Engineers (IEEE) 1394standard), and is stored on digital storage device 300. In anothersuitable approach, an MPEG-2 data stream or series of files may bereceived from distribution equipment 170 and stored.

Television 280 receives video signals from secondary storage device 290via communications path 330. The video signals on communications path330 may either be generated by secondary storage device 290 when playingback prerecorded storage media (e.g., a videocassette or a recordableDVD), by digital storage device 300 when playing back pre-recordeddigital media, may be passed through from set-top box 260, may beprovided directly to television 280 from set-top box 260 if secondarystorage device 290 is not included in user-television equipment 200, ormay be received directly by television 280. During normal televisionviewing, the video signals provided to television 280 correspond to thedesired channel to which a user has tuned with set-top box 260. Videosignals may also be provided to television 280 by set-top box 260 whenset-top box 260 is used to play back information stored on digitalstorage device 300.

Set-top box 260 may have memory 340. Memory 340 may be any memory orother storage device, such as a random access memory (RAM), read onlymemory (ROM), flash memory, a hard disk drive, a combination of suchdevices, or any other suitable storage device, that is suitable forstoring media guidance application instructions and guidance applicationdata for use by the media guidance application.

Set-top box 260 may have communications device 350 for communicatingdirectly with distribution equipment 170, guide server 210 or Internetservice system 220 over communications path 190. Communications device350 may be a modem (e.g., any suitable analog or digital standard,cellular, or cable modem), network interface card (e.g., an Ethernetcard, Token ring card, or any other suitable network interface card),any other suitable communications device, or a combination of suchdevices. Communications device 350 may be a personal computer with anInternet connection in, for example, the arrangement shown in FIGS. 2Cand 2D. Television 280 may also have such a suitable communicationsdevice if desired. In another suitable approach, user televisionequipment 200 may communicate with Internet service system 220 viadistribution equipment 170 using a suitable return path.

A more generalized embodiment of user television equipment 200 of FIG. 3is shown in FIG. 4. As shown in FIG. 4, guidance application data fromdistribution facility 180 (FIG. 2A) is received by processing circuitry360 of user television equipment 200. Processing circuitry 360 mayinclude any suitable processor, such as a microprocessor or group ofmicroprocessors (e.g., an Intel Pentium microprocessor, an IBM PowerPCmicroprocessor, or any other suitable microprocessor), and otherprocessing circuitry such as caching circuitry, video decodingcircuitry, direct memory access (DMA) circuitry, input/output (I/O)circuitry, any other suitable processing circuitry, or a combination ofsuch circuitry. The functions of processing circuitry 360 may beprovided using the set-top box arrangement of FIG. 3. In anothersuitable approach, these functions may be integrated into an advancedtelevision receiver, personal computer television (PC/TV), or any othersuitable arrangement. If desired, a combination of such arrangements maybe used.

User television equipment 200 may have secondary storage device 370 anddigital storage device 380 for recording media. Secondary storage device370 can be any-suitable type of analog or digital program storage device(e.g., a videocassette recorder (VCR), a personal video recorder (PVR),a digital versatile disc (DVD) recorder, or any other suitable device).Program recording and other features may be controlled by processingcircuitry 360. Digital storage device 380 may be, for example, awriteable optical storage device (such as a DVD recorder capable ofhandling recordable DVD discs) a magnetic storage device (such as a diskdrive or digital tape), or any other digital storage device.

User television equipment 200 may have memory 390. Memory 390 may be anymemory or other storage device, such as a random access memory (RAM),read only memory (ROM), flash memory, a hard disk drive, any othersuitable device, or a combination of such devices, that is suitable forstoring media guidance application instructions and guidance applicationdata for use by processing circuitry 360.

User television equipment 200 of FIG. 4 may have communications device400 for supporting communications between the interactive media guidanceapplication and distribution equipment 170, guide server 210, orInternet service system 220 via communications path 190. Communicationsdevice 400 may be a modem (e.g., any suitable analog or digitalstandard, cellular, or cable modem), network interface card (e.g., anEthernet card, Token ring card, or any other suitable network interfacecard), or other suitable communications device.

A user may control the operation of user television equipment 200 withuser input device 410. User input device 410 may be a pointing device,wireless remote control, keyboard, touch-pad, microphone, or any othersuitable user input device. In one suitable approach, a microphone mayreceive voice commands from a user, which may be processed by voicerecognition system 435. The microphone may be wired or wireless. In thecase of just one microphone, it may be located in a communal place in aroom. However, the microphone may be included in a remote controldevice, a set-top box, or any other suitable location. A remote controldevice with a microphone may include a button for the user to press toalert processing circuitry 360 that the user desires to give a voicecommand. Such a button may be held by the user while giving the voicecommand. In another suitable approach, the user may press and releasethe button prior to giving the voice command.

Voice recognition system 435 may be included separate from otherequipment, in processing circuitry 360, in guide server 210 or otherequipment at a headend, or in any other suitable location. Voicerecognition system 435 may receive voice commands via a microphone,(e.g., user input device 410). Voice recognition system 435 may includemedia cancellation circuitry that may monitor the audio output of usertelevision equipment 200 or personal computer 240. This audio output maybe subtracted from input to the microphone which may be, for example,voice commands from a user. Thus, user voice commands may be distinctfrom audio output of user television equipment 200 or personal computer240. Voice recognition system 435 may be able to determine a user'sidentity. Voice recognition system 435 may make such an identificationby the sound of the voice giving the voice command, by the input deviceused to give the voice command (e.g., by associating a user with aspecific microphone for receiving voice commands), or by any othersuitable approach for user identification.

Audible output related to an interactive media guidance application maybe provided by one or more speakers 415. Speaker 415 may be separatefrom display device 420. In another suitable approach, speaker 415 maybe integrated into display device 420.

To watch television, for example, a user may instruct processingcircuitry 360 to display a desired television channel on display device420. Display device 420 may be any suitable television, monitor, orother suitable display device. To access the functions of theinteractive media guidance application, a user may instruct theinteractive media guidance application implemented on interactive mediaguidance application equipment 130 to generate a main menu or otherdesired guide display screen for display on display device 420.Alternate display device 425 may be provided to display guide displayscreens such as menus, browse displays, or any other suitable screen.Alternate display device 425 may be a hand-held device (e.g., a remotecontrol), a monitor, or any other suitable device. Alternate displaydevice 425 may be used for guide display screens thereby allowing theentire viewing area of display device 420 to be used for programming.

A user interface with speech recognition is provided to allow users tonavigate within interactive media guidance applications, such astelevision program guides, guides for audio services, guides forvideo-on-demand (VOD) services, guides for personal video recorders(PVRs), or any other suitable guidance applications. Such an interfacemay provide hands-free control over some features of such media guidanceapplications. A device such as a wired or wireless microphone may beused to receive guidance voice commands from a user. For clarity, thepresent invention will be described in terms of a microphone as the userinterface device which is used to receive a user's voice commands.However, any other device suitable for receiving a user's voice commandsmay be used, if desired. For example, a remote access device may receivevoice commands. Interactive guide systems and remote access devices forsuch systems that have speech interfaces are described, for example, inEllis et al. U.S. patent application Ser. No. 09/354,344, filed Jul. 16,1999, which is hereby incorporated by reference herein in its entirety.

Some embodiments may receive voice commands from a user and may performguidance activities in response to the voice commands. Such guidanceactivities may include, for example, source selection, browsing,obtaining media information, recording, playing back, or any othersuitable guidance activity. Guidance activities may be performed onmedia sources which may be provided by interactive media guidanceapplications. Such media sources may include, for example, televisionchannels, audio channels, video-on-demand content, recorded content(e.g., content recorded to a PVR, VCR, CD recorder, DVD recorder, or anyother suitable recording device), Internet-delivered content (e.g.,websites, video, or any other suitable Internet-delivered content), orany other suitable media source. Media source content may include menus,browse displays, or any other suitable display associated with a mediasource.

Some embodiments may select media sources in response to received voicecommands. Source selection may be performed, from any suitable mediasource to select any other suitable media source. For example, a usermay be provided with content from a first media source, such as atelevision program. Upon receiving a voice command indicating contentfrom a user-selected second media source, such as media recorded to aPVR, the interactive media guidance application may perform the sourceselection and may provide the user with the content from the secondsource. Thus, the television program, which is the first media sourcecontent, may be replaced or augmented by recorded media from the PVR,which is the second media source content, in response to the receivedvoice command. In another example, a user may be provided with, forexample, Internet-delivered media. Upon receiving a voice commandindicating a user selection of an audio channel, the interactive mediaguidance application may perform the source selection and provide theuser with the specified audio channel.

Source selection may be performed across different sources. FIGS. 5A and5B show illustrative displays for source selection from content from afirst media source, a television program, to content from a second mediasource, an Internet-delivered application, in accordance with oneembodiment of the invention. Display 500 of FIG. 5A may include contentfrom a first media source which may be, for example, a televisionprogram. Overlay 570 may be superimposed on display 500. In anothersuitable approach, overlay 570 may be provided on a display deviceseparate from the device that includes display 500. For example, overlay570 may be provided on an alternate display device, such as alternatedisplay device 425 (shown in FIG. 4). Alternate display device 425 maybe a hand-held device (e.g., a remote control), a monitor, or any othersuitable device. Overlay 570 may include an advertisement 510, aninformation region 530, a provider logo 540, and a clock 550.Information, region 530 may further include a source identifier 520.Source identifier 520 indicates that the media source content currentlyprovided to the user is a television program on Channel 7, WBAC.

To select content from a second media source, for example anInternet-delivered application, a voice command from a user, such as“tvguide.com,” may be received by a microphone. After receiving thevoice command, the interactive media guidance application may displaythe user-selected website, for example, in FIG. SB. As shown in FIG. 58,display 550 may include an Internet-delivered application. Sourceidentifier 520 indicates that the media source content currentlyprovided to the user is a website, specifically, www.tvguide.com. Asshown in source identifier 520, www.tvguide.com is website “1 of 100,”illustrating that the website is one of 100 possible websites forselection as source content. Such websites may be determined by theinteractive media guidance application, or they may be specified by auser. In another suitable approach, a voice command from a user such as“surf web” may be received to launch a web browser to the user'shomepage from the television program (e.g., the first media source). Avoice command such as “surf tvguide.com” may be received from the userto go to the tvguide.com website. From the website, a voice command suchas “click listings” may be received from the user to simulate the actionof Clicking on a link labeled “listings” by the user. The Internetapplication may provide the user with suitable listings in response tothe received voice command. Overlay 570, which may indicate the currentmedia source content, may be removed from display 560 after a fewseconds. In another suitable approach, overlay. 570 may be provided on adisplay device separate from the device that includes display 560. Forexample, overlay 570 may be provided on an alternate display device,such as alternate display device 425 (shown in FIG. 4). Alternatedisplay device 425 may be a hand-held device (e.g., a remote control), amonitor, or any other suitable device. Thus, by receiving a suitablevoice command from a user that specifies a selection of content from asource type differing from the current source type, some embodiments mayprovide the user with content from the new source, thereby demonstratinga Source selection-guidance activity.

The example of FIG. 5 illustrates content from a second source providedto a user from content from a first source. In another suitableapproach, some feature of content from a second source (e.g., listings,reminders, favorites, or any other suitable feature) may be provided toa user from content from a first source. The example in FIG. 5 is shownin the context of a television channel and an Internet-deliveredapplication. Source selection may be utilized with other media sourcessuch as audio channels, video-on-demand content, recorded content, orany other suitable media source. In one suitable approach, the first andsecond media sources may be two different types of media sources. Inanother suitable approach, the first and second media sources may be ofthe same type. For example, to access a second television channel from afirst television channel, a voice command from a user may be received.Such a voice command may include “channel up,” “channel down,” “lastchannel” (e.g., to access a previous channel), “next favorite channel”(e.g., to access a favorite channel), “HBO” (e.g., to access a channelby channel name), “watch ‘Friends’” (e.g., to access a specifictelevision program), or any other suitable voice command. In response toreceiving the voice command, the second television channel may beprovided.

A flowchart of illustrative steps involved in source selection in aninteractive media guidance application is shown in FIG. 6. Content froma first media source (e.g., television channels, audio channels,video-on-demand content, recorded content (e.g., content recorded to aPVR, VCR, CD recorder, DVD recorder, or any other suitable recordingdevice), Internet-delivered content (e.g., websites, video, or any othersuitable Internet-delivered content), or any other suitable mediasource) may be provided to a user at step 610. An example of this stepis shown in display 500 of FIG. 5A. After content from the first sourceis provided to the user, a voice command from the user may be receivedindicating a second source (e.g., television channels, audio channels,video-on-demand content, recorded content (e.g., content recorded to aPVR, VCR, CD recorder, DVD recorder, or any other suitable recordingdevice), Internet-delivered content (e.g., websites, video, or any othersuitable Internet-delivered content), or any other suitable mediasource) at step 620. The second source may be a different type, of mediasource than the first source (e.g., the first source is a televisionchannel and the second source is an Internet application). In anothersuitable approach, the second source may be the same type of mediasource as the first source (e.g., the first and second sources are bothtelevision channels). Content from the second source may be provided tothe user at step 630. An example of this step is shown in display 560 ofFIG. 5B. The above example is merely illustrative, and any suitablesteps may be employed in source selection in a media guidanceapplication, if desired.

Some embodiments may perform browsing operations in response to receivedvoice commands. Browsing operations may be performed from any suitablemedia source to browse media available for any suitable media source.For example, a user may be provided with content from a first mediasource, such as a television program. Upon receiving a voice commandindicating content from a second media source for browsing, such ascontent recorded to a PVR, the interactive media guidance applicationmay provide the user with a browse display for the content from thesecond source. Such a browse display may contain a list of contentrecorded to the PVR. Thus, in response to the received voice command, abrowse display containing a list of recorded PVR content (which is thesecond media source) may be provided as an overlay on the televisionprogram (which is the content from the first media source). In anothersuitable approach, the television program video may be reduced in sizeso that the browse overlay does, not obscure any of it. In yet anothersuitable approach, the overlay may be provided on a display device thatis separate from the display device that includes the first sourcecontent. In another example of browsing operations, a user may beprovided with an Internet-delivered application. Upon receiving a voicecommand indicating that the user wishes to browse a list of audiochannels, the interactive media guidance application may provide theuser with a browse display for audio channels as an overlay on theInternet delivered application. In another suitable approach, theoverlay may be provided on a display device that is separate from thedisplay device that includes the first source content.

Browsing operations may be performed across different types of sources.FIGS. 7A and 7B show illustrative displays for browsing from contentfrom a first media source, recorded content from a PVR, for content froma second media source, television channel listings, in accordance withone embodiment of the invention. Display 800 of FIG. 7A may includecontent from a first media source which may be, for example, recordedcontent from a PVR. To browse for content from a second media source,for example, television channel listings, a voice command from a user,such as “browse television channels,” may be received. After receivingthe voice command, the interactive media guidance application maydisplay a browse overlay as shown, for example, in FIG. 7B.

As shown in FIG. 78, display 870 may include recorded content from thePVR (e.g., the first source). Display 870 may include browse overlay820. In another suitable approach, overlay 824 may be provided on adisplay device that is separate from the display device that includesdisplay 870. For example, overlay 820 may be provided on an alternatedisplay device, such as alternate display device 425 (shown in FIG. 4).Alternate display device 425 may be a hand-held device (e.g., a remotecontrol), a monitor, or any other suitable device overlay 820 mayinclude television channel listings for browsing in browse region 830.Up and down arrows 880 and right and left arrows 890 may be provided tofacilitate browsing up and down to other television channels and leftand right to other time slots, respectively. In another suitableapproach, voice commands from the user may be received to browse toother television channels and time slots. Voice commands to browse toother time slots and television channels may include, for example, “up,”“down,” “later,” “earlier,” “channel 8” (e.g., to view the listings fora specific channel number), “HBO” (e.g., to view the listings for aspecific channel name), or any other suitable voice command. Voicecommands may be received from the user to perform actions from thebrowse display such as, for example, “watch” (e.g., to tune to thechannel showing the currently-selected program), “select ad” (e.g., toselect an advertisement on the browse display), “info” (e.g., to obtaininformation on a listing), or any other suitable voice command. Overlay820 may also include an advertisement 810, a provider logo 860, and aclock 850. Thus, by receiving a suitable voice command from a user thatspecifies source content for browsing, some embodiments may provide theuser with a browse overlay containing a list of content for the secondsource, thereby demonstrating a browsing operation.

While the example in FIG. 7 is shown in the context of recorded contentfrom a PVR and television channels, browsing-operations' may be utilizedwith other types of media sources such as audio channels,video-on-demand content, Internet-delivered content, or any othersuitable media source. In one suitable approach the first and secondmedia sources may be two different types of media sources. In anothersuitable approach, the first and second media sources may be of the sametype. For example, to browse video-on-demand listings (e.g., the secondsource) from a video-on-demand (e.g., the first source), a voice commandfrom a user may be received. Such a voice command may be “browse VOD.”In response to receiving the voice command, a browse display thatincludes video-on-demand listings may be provided to the user. Voicecommands may be received while browsing video-on-demand to performactions from the video-on-demand browse display, such as, for example,“next movie,” “buy,” or any other suitable command.

A flowchart of illustrative steps involved in browsing operations ininteractive media guidance applications is shown in FIG. 8. Content froma first media source (e.g., television channels, audio channels,video-on-demand content, recorded content (e.g., content recorded to aPVR, VCR, CO recorder, DVD recorder, or any other suitable recordingdevice), Internet-delivered content (e.g., websites, video, or any othersuitable Internet-delivered content), or any other suitable mediasource) may be provided to a user at step 910. An example of this stepis shown in display 800 of FIG. 7A. After content from the first mediasource is provided to the user, a voice command may be received from theuser indicating a second media source for a browsing operation (e.g.,television channels, audio channels, video-on-demand content, recordedcontent (e.g., content recorded to a PVR, VCR, CD recorder, DVDrecorder, or any other suitable recording device), Internet-deliveredcontent (e.g., websites, video, or any other suitable Internet-deliveredcontent), or any other suitable media source) at step 920. A browseoverlay corresponding to content from the second media source may beprovided to the user at step 930. An example of this step is shown indisplay 870 of FIG. 7B. The above example is merely illustrative, andany suitable steps may be employed in a browsing operation for a mediaguidance application, if desired.

Some embodiments may provide users with information corresponding to amedia source in response to received voice commands. Responses toinformation requests may be provided from any suitable media source andmay provide information for any suitable media source. For example, auser may be provided with content from a first media source, such as atelevision program. Upon receiving a voice command indicating aninformation request for content from a second media source, such ascontent recorded to a PVR, the interactive media guidance applicationmay provide the user with audible information related to the content. Inanother suitable approach, the interactive media guidance applicationmay provide the user with visual information related to the content fromthe second media source. Such information may include, for example, thenumber of files saved on the PVR, the name of the most recently-recordedfile, or any other suitable information related to content recorded tothe PVR. Thus, audio for the television program, which is the contentfrom the first media source, may be replaced or augmented by audibleinformation related to recorded content from the PVR, which is thecontent from the second media source, in response to the received voicecommand. In another suitable approach, the volume of audio for thetelevision program may be lowered or muted.

In another example of providing a user with information, a user may beprovided with, for example, an Internet-delivered application. Uponreceiving a voice command indicating a request for information for anaudio channel, the interactive media guidance application may performthe information request and provide the user with audible informationrelated to the audio channel. Such audible information may replace oraugment any audio associated with the Internet-delivered application. Inanother suitable approach, the volume of audio for theInternet-delivered application may be lowered or muted.

Audible information may be provided across different sources. Aflowchart of illustrative steps involved in providing audibleinformation corresponding to a media source is shown in FIG. 9. Contentfrom a first media source (e.g., television channels, audio channels,video-on-demand content, recorded content (e.g., content recorded to aPVR, VCR, CD recorder, DVD recorder, or any other suitable recordingdevice), Internet-delivered content websites, video, or any othersuitable Internet-delivered content), or any other suitable mediasource) may be provided to a user at step 1010. After content from thefirst media source is provided to the user, a voice command may bereceived from the user indicating that the user wishes to receiveinformation corresponding to a second media source (e.g., televisionchannels, audio channels, video-on-demand content, recorded content(e.g., content recorded to a PVR, VCR, CD recorder, DVD recorder, or anyother suitable recording device), Internet-delivered content (e.g.,websites, video, or any other suitable Internet-delivered content), orany other suitable media source) at step 1020. Such a voice command maybe, for example, “info, PVR,” corresponding to a request for informationfor content recorded to a. PVR. Another illustrative voice command maybe “info, audio channels,” corresponding to a request for informationfor audio channels. Yet another illustrative voice command may be “info,HBO,” corresponding to a request for information for a specifictelevision channel (e.g., HBO). Similarly, a voice command for any othersuitable media source may be received. Audible information for contentfrom the second media source may be provided to the user at step 1030.This may be performed using audio clips or files associated with theguidance application data, streamed audio, audio generated from text, orusing any other suitable approach. The above example is merelyillustrative, and any suitable steps may be employed in order to provideaudible information for a source, if desired.

Some embodiments may record content from a media source in response toreceived voice commands. Recording may be performed from any suitablemedia source. For example, a user may be provided with content from afirst media source. Upon receiving a voice command indicating that theuser wishes to record content from a second media source, such as“record,” the content may be recorded to a desired recording device. Inanother suitable approach, source content may be recorded from anassociated browse screen, listing screen, information screen or anyother suitable associated screen. In yet another suitable approach, avoice command specifying a program or a channel from which to recordcontent may be received, for example, “record ‘Seinfeld.’” This is anexample of “one-word record” functionality. A default recording device(e.g., DVD recorder, PVR, VCR, CD recorder, or any other suitablerecording device) may be specified by the user at some time prior torecording such that upon receiving the command “record,” the content ofthe second media source is automatically recorded to the default device.

In another suitable approach, the device on which to record the contentfrom the second media source may be specified by receiving an additionalcommand from the user, such as “PVR,” “VCR,” “DVD recorder,” “CDrecorder,” or any other command corresponding to a suitable recordingdevice. In yet another suitable approach, multiple users may be providedfor such that a file to record the content may be specified by the user.For example, an audible question may be provided to the user, such as,“What file?” To this question, a user name such as “Dad” may be receivedfrom the user. Thus, the content may be recorded to Dad's file. In yetanother suitable approach, a specific file to record content from asecond media source may be chosen based on voice recognition. Forexample, a voice command to record may be received from a user. A voicerecognition system, such as voice recognition system 435 of FIG. 4, maydetermine the user's identity based on the user's voice, based on themicrophone that received the voice command, or based on any othersuitable approach. The content of the second media source may record toa file associated with the identity of the user.

In an example of recording media source content in response to receivedvoice commands, an audio channel may be provided to a user. A Voicecommand such as “record” may be received indicating, for example, thatthe user wishes to record the content of the audio channel. This contentmay be recorded to a default device. However, an additional command maybe received indicating a user-specified device on which to record thecontent of the audio channel. For example, a command such as “CDrecorder” may be received. In another example, an Internet-deliveredapplication, such as a website, may be provided to a user. A voicecommand such as “record” may be received indicating, for example, thatthe user wishes to save the link to the website. This link may be saved,for example, in a directory of websites in a suitable media guidanceapplication. The media source content, in this example the link, may berecorded to a general file for use by all users. However, an additionalcommand may be received indicating a user-specified file in which tosave the link. For example, a command such as “Lisa's file” may bereceived, indicating that Lisa's file is the desired destination for thelink. Voice commands may be received from a user to specify recordingparameters such as start time, end time, recording quality, or any othersuitable recording parameter.

Sources may be recorded in response to received voice-commands from auser. A flowchart of illustrative steps involved in recording mediasource content is shown in FIG. 10. Content from a first media source(e.g., television channels, audio channels, video-on-demand content,recorded content (e.g., content recorded to a PVR, VCR, CD recorder, DVDrecorder, or any other suitable recording device), Internet-deliveredcontent (e.g., websites, video, or any other suitable Internet-deliveredcontent), or any other suitable media source) may be provided to a userat step 1110. After the content from the first media source is providedto the user, a voice command may be received from the user indicatingthat the user wishes to record content from a second media-source atstep 1120. Such a voice command may be, for example, “record.” Inanother suitable approach, the content from the second media source maybe recorded from an associated browse screen, listing screen,information screen, or any other suitable associated screen. In onesuitable approach, the second source may be the same type of mediasource as the first source (e.g., the first and second sources are bothtelevision channels). In another suitable approach, the first and secondsources may be different types of media sources (e.g., the first sourceis a television channel and the second source is an audio channel). Avoice command may be received from the user specifying a device to whichthe content should record at step 1130. Such a voice command may be, forexample, “DVD recorder,” “CD recorder,” “VCR,” “PVR,” or any othersuitable command corresponding to a recording device.

A voice command may be received from the user specifying a file to whichthe content should record at step 1140. Such a voice command may be, forexample, “Linda's file,” “Mom's file,” “Joan's file,” or any othersuitable command corresponding to a file. In another suitable approach,a specific file to record content from a second media source may bechosen based on voice recognition. For example, a user may give a voicecommand to record. A voice recognition system, such as voice recognitionsystem 435 of FIG. 4, may determine the user's identity based on theuser's voice, based on the microphone that received the voice command,or based on any other suitable approach. The media source content mayrecord to a file associated with the identity of the user. The contentmay be recorded to the user-specified device and to the user-specifiedfile at step 1150. The above example is merely illustrative, and anysuitable steps may be employed in order to record media source content,if desired. For example, it may be unnecessary to receive identificationof user-specified devices and/or files. Voice commands may be receivedfrom a user to specify recording parameters such as start time, endtime, recording quality, or any other suitable recording parameter.

Some embodiments may play back previously-recorded media source contentin response to received voice commands. Playing back may be performedfrom any suitable media source. For example, a user may be provided withcontent from a media source. Upon receiving a voice command indicatingthat the user wishes to play back previously-recorded media sourcecontent, such as “play back,” previously-recorded media source contentmay be played back from a desired recording device. A default recordingdevice (e.g., DVD recorder, PVR, VCR, CD recorder, or any other suitablerecording device) may be specified by the user at some time prior toplaying back such that, upon receiving the command “play back,” thepreviously-recorded media source content is automatically played backfrom the default device. The previously-recorded media source contentmay not be content recorded by the user. For example, the content may bevideo-on-demand.

In another suitable approach, the device from which to play back thepreviously-recorded media source content may be specified by receivingan additional command from the user, such as “PVR,” “VCR,” “DVDrecorder,” “CD recorder,” or a command corresponding to any othersuitable recording device. In yet another suitable approach, multipleusers may be provided for such that a file from which to play back themedia source content may be specified by the user. For example, anaudible question may be provided to the user, such as, “What file?” Tothis question, a user name such as “Dad” may be received from the user.Thus, the source content may be played back from Dad's file. In yetanother suitable approach, a specific file from which to play back mediasource content may be chosen based on voice recognition. For example, avoice command to play back may be received from a user. A voicerecognition system, such as voice recognition system 435 of FIG. 4, maydetermine the user's identity based on the user's voice, based on themicrophone that received the voice command, or based on any othersuitable approach. The source content may be played back from a fileassociated with the identity of the user.

Previously-recorded media source content may be played back in responseto receiving voice commands from a user. A flowchart of illustrativesteps involved in playing back previously-recorded media source contentis shown in FIG. 11. Content from a media source (e.g., televisionchannels, audio channels, video-on-demand content, recorded content(e.g., content recorded to a PVR, VCR, CD recorder, DVD recorder, or anyother suitable recording device), Internet-delivered content (e.g.,websites, video, or any other suitable Internet-delivered content), orany other suitable media source) may be provided to a user at step 1210.After the Content from the media source is provided to the user, a voicecommand may be received from the user indicating that the user wishes toplay back previously-recorded media source content. Such a voice commandmay be, for example, “play back.”

A voice command may be received from the user specifying a device fromwhich the previously-recorded media source, content should play back atstep 1230. Such a voice command may be, for example, “DVD recorder,” “CDrecorder,” “VCR,” “PVR,” or any other suitable command corresponding toa recording device. A voice command may be received from the userspecifying a file from which the previously-recorded media sourcecontent should play back at step 1240. Such a voice command may be, forexample, “Linda's file,” “Mom's file,” “Joan's file,” or any othersuitable command corresponding to a file. In another suitable approach,a specific file from which to play back media source content may bechosen based on voice recognition. For example, a voice command to playback may be received from a user. A voice recognition system, such asvoice recognition system 435 of FIG. 4, may determine the identity ofthe user based on the user's voice, based on the microphone thatreceived the voice command, or based on any other suitable approach. Themedia source content may be played back from a file associated with theidentity of the user. The previously-recorded source content may beplayed back from the user-specified device and from the user-specifiedfile at step 1250. The above example is merely illustrative, and anysuitable steps may be employed in order to play back previously-recordedmedia source content, if desired. For example, it may be unnecessary toreceive identification of user-specified devices and/or files. Voicecommands from a user may be received corresponding to a specific programname. In response to the received voice command, the previously-recordedprogram may be played back. Voice commands from a user may be receivedwhile a program is playing back, such as, for example, “stop,” “pause,”“play,” fast forward,” “rewind,” or any other suitable command.

Some embodiments may receive selections of features for an audiblefavorites list from a user and may provide the features of the favoriteslist to the user in an audible scan. For example, a user may be providedwith a menu from which features may be selected for inclusion in anaudible favorites list. Such a menu may include, for example, non-mediaoptions (e.g., sports, weather, news, stocks, traffic, or any othersuitable non-media option), media-related options (e.g., channels,programs, settings, reminders, advertisements, or any other suitablemedia-related option), or any other features suitable foe inclusion inthe audible favorites list. The interactive media guidance applicationmay provide information related to the user-selected features in anaudible scan. This information may be provided to the userautomatically. In another suitable approach, this information may beprovided to the user in response to a user voice command requesting tolisten to the audible, favorites. The audible information may beprovided using audio clips or files associated with the favoritefeatures, streamed audio, audio generated from text, or using any othersuitable approach. The user may interact with the information providedin the audible scan.

Selections of features for an audible favorites list may be receivedfrom a user, and information related to the features may be provided tothe user in an audible scan. As shown in FIG. 12, a display 700 may beprovided to the user. Display 700 may include, for example, a menu 780of selectable program guide features, one or more selectableadvertisements 710, a cable operator or sponsor brand graphic 770, thecurrent time 790, a mail-available indicator 760, a web browser link750, and a media-in-guide window 720. One illustrative program guidefeature is an Audible Favorites option 730. A selection of option 730from the user may be received. The user's selection of option 730 may bemade by using suitable key on a remote control, by giving a suitablevoice command, or by any other suitable approach. In another suitableapproach, a selection of Favorites option 731 may be received. Option731 may allow creation of an audible favorites lists (not shown).

FIG. 13 shows an illustrative audible favorites set-up display 1400 thatmay be provided in response to user-selected option 730. Display 1400may contain, for example, a menu 755 of selectable favorites features,one or more selectable advertisements 710, a cable operator or sponsorbrand graphic 770, the current time 790, a mail-available indicator 760,and a media-in-guide window 720. Selection of elements from menu 755 tobe included in an audible favorites list may be received. Menu 755 maycontain non-media options for inclusion in the favorites list. Possiblenon-media options may include, for example, sports, weather, news,stocks, traffic, or any other suitable non-media option. Menu 755 mayalso contain media-related options, such as, for example, channels,programs, settings, reminders, advertisements, or any other suitablemedia-related option. For example, an audible scan of favorite channelsmay include a description of programming currently airing on a user'sfavorite channels. Up and down arrows 715 may be utilized to scrollthrough menu 755. A user may, utilize up and down arrows 715 by using upand down keys on a remote control, by using voice commands such as “up”and “down,” or by using any other suitable approach. As shown, options735 and 745, sports and weather, respectively, have been selected by theuser from menu 755 for inclusion in the favorites list. To enter theselections, button 725 may be selected by the user. In another suitableapproach, a favorites key on a remote control may be selected by theuser to enter the selections. In yet another suitable approach, a voicecommand from the user may be received to enter the selections.

The features of the favorites list may be provided to the user, inaudible form in an audible scan mode. For example, for thefavorites-list described above, audible weather and sports informationmay be provided to the user. This information may be provided to theuser automatically. In another suitable approach, this information maybe provided to the user in response to a user voice command. Such acommand may be, for example, “favorites.” The audible information may beprovided using audio clips or files associated with the favoritesfeatures, streamed audio, audio generated from text, or using any othersuitable approach. Multiple favorites lists corresponding to multipleusers may be available. A favorites list corresponding to a specificuser may be provided based on voice recognition. For example, a voicecommand may be received from a user to hear a favorites list. A voicerecognition system, such as voice recognition system 435 of FIG. 4, maydetermine the user's identity based on the user's voice, based on themicrophone that received the voice command, or based on any othersuitable approach. The favorites list associated with the identity ofthe user may be provided.

A flowchart of illustrative steps involved in receiving selections offeatures for an audible favorites list from a user and providing thefeatures of the favorites list to the user in an audible scan is shownin FIG. 14. An indication of features from a user for inclusion in afavorites list is received at step 1310. An illustrative example of thisstep is shown in FIG. 13. After receiving the indication of featuresfrom the user, an indication to initiate an audible scan of informationrelated to the features of the favorites list may be received at step1320. Such an indication may be received from the user (e.g., byreceiving a voice command such as “favorites”), from a timer which isprogrammed with a specific interval suitable for providing the passivescan, or from any other suitable indicator. The audible scan of thefeatures may be provided to the user (e.g., by using audio clips orfiles associated with the features, streamed audio, audio generated fromtext, or any other suitable way to provide audible information) at step1330. An indication may be received from the use to access a specificfeature of the favorites list favorite channels, favorite programs,reminders, or any other suitable feature) at step 1340. Audibleinformation corresponding to the specific feature may be provided to theuser at step 1350. For example, in response to an indication to accessfavorite channels, media corresponding to a favorite channel may beprovided to the user. The above example is merely illustrative, and anysuitable steps may be employed in order to receive selections offeatures for an audible favorites list from a user and provide thefeatures of the favorites list to the user in an audible scan. Forexample, a user may interact with the information provided in theaudible scan. The user may be able to give a voice command such as“stop” (e.g., to stop the audible scan), “watch,” “listen,” “record,” orany other suitable command related to a favorites feature.

Some embodiments may provide features specific to a user based on theuser's identity. For example, a voice command from a user may bereceived by a microphone. A voice recognition system may determine theuser's identity by the sound of the voice giving the voice command, bythe microphone used to give the voice command (e.g., by associating aspecific user with a specific microphone for receiving voice commands),or by any other suitable approach for user identification. The identityof the user may be used to provide features specific to the user. Suchfeatures may include, for example, customized guide interfaces, targetedadvertisements, audience measuring, favorites, or any other suitableuser-specific feature.

A flowchart of illustrative steps involved in determining the identityof a user and providing a feature specific to the user is shown in FIG.15. A voice command from a user may be received at step 1410. Afterreceiving the voice command from the user, the guidance application maydetermine the identity of the user at step 1420. For example, theidentity of a user may be determined by the sound of the voice-givingthe voice command, by the microphone used to give the voice command(e.g., by associating a specific user with a specific microphone forreceiving voice commands), or by any other suitable approach for useridentification. A feature specific to the user may be performed at step1430. For example, features such as customized guide interfaces,targeted advertisements, audience, measuring, favorites, or any othersuitable feature may be provided to the user at step 1430. Informationpreviously-provided by a user (e.g., ratings, genre, favorites, etc.)may be used to provide the user with a customized guide interface. Forexample, explicit information that the user provides either verbally ornon-verbally may be used to create a guide interface appropriate for thecurrent user. The above example is merely illustrative, and any suitablesteps may be employed in order to determine the identity of a user andprovide a feature specific to the user.

Some embodiments may provide a user with targeted advertisements basedon the user's monitored actions. An example of an interactive televisionprogram guide in which targeted advertisements may be presented to auser is described, for example, in Knudson et al. U.S. patentapplication Ser. No. 09/034,939, filed Mar. 4, 1998. Actions of a usermay be monitored to determine user characteristic's. For example,actions such as programs viewed, websites visited, music selected, orany other suitable action, may be monitored. A voice command from a usermay be received by a microphone. A voice recognition system, such asvoice recognition system 435 of FIG. 4, may determine the user'sidentity. The voice recognition system may be able determine the user'sidentity by the sound of the voice giving the voice command, by themicrophone used to give the voice command (e.g., by associating aspecific user with a specific microphone for receiving voice commands),or by any other suitable approach for user identification. The user maybe provided with targeted advertisements based on the user'scharacteristics. For example, a user may consistently watch actionmovies. In response to the user's interest in action movies,advertisements for upcoming action movies may be provided to the user.In one suitable approach, targeted advertisements may be associated witha user for use in the future. For example, the next time a mediaguidance application is utilized by the user and a voice command isreceived, targeted advertisements may be provided to the user inresponse to the user's previous actions.

A flowchart of illustrative steps involved in providing a user withtargeted advertisements based on the user's monitored actions is shownin FIG. 16. Actions of the user may be monitored at step 1510 todetermine user characteristics. For example, actions such as programsviewed, websites visited, music selected, or any other suitable action,may be monitored. Such user characteristics may be used to determineadvertisements appropriate for the user. A voice command from a user maybe received at step 1520. After receiving the voice command from theuser, the guidance application may determine the identity of the user atstep 1530. For example, the identity of a user may be determined by thesound of the voice giving the voice command, by the microphone used togive the voice command (e.g., by associating a specific user with aspecific microphone for receiving voice commands), or by any othersuitable approach for user identification. The guidance application maydetermine which advertisements are appropriate for the user based on theuser's characteristics at step 1540. The user may be provided withtargeted advertisements at step 1550. The above example is merelyillustrative, and any suitable steps may be employed in order to providea user with targeted advertisements.

Some embodiments may determine the identity of a user and measure anaudience based on the user's profile. The user's profile may includedemographic data. For example, a voice command from a user may bereceived by a microphone. A voice recognition system, such as voicerecognition system 435 of FIG. 9, may determine the user's identity. Thevoice recognition system may be able to identify a user by the sound ofthe voice giving the voice command, by the microphone used to give thevoice command (e.g., by associating a specific user with a specificmicrophone for receiving voice commands), or by any other suitableapproach for user identification. The user may be provided with contentfroth a media source, such as a television program. Audience informationfor the television program may be measured by using a user profileassociated with the user. The user profile may include demographic datasuch as, for example, age, gender, nationality, profession, income,location, or any other suitable data. The audience information may bemeasured using this data to create a detailed description of theaudience. The description of the audience may result in determiningfuture marketing opportunities for the television program (e.g.,suitable commercials to be shown during the television program),attracting other viewers with similar demographic data, or any othersuitable result.

A flowchart of illustrative steps involved in determining the identityof a user and measuring an audience based on the user's profile is shownin FIG. 17. At step 1610, a voice command from a user may be received.After receiving the voice command from the user, the guidanceapplication may determine the identity of the user step 1620. Forexample, the identity of a user may be determined by the sound of thevoice giving the voice command, by the microphone used to give the voicecommand (e.g., by associating a specific user with a specific microphonefor receiving voice commands), or by any other suitable approach foruser identification. The user may be provided with content from a mediasource at step 1630. Audience information for the content from the mediasource may be measured at step 1640 based on the identity of the User.The identity of the user may include user profile data such as, forexample, age, gender, nationality, profession, income, location, or anyother suitable data. The audience may be measured using this data tocreate a detailed description of the audience. The above example ismerely illustrative, and any suitable steps may be employed in order tomeasure an audience based on a user's identity.

Some embodiments may determine the identity of a user and may providethe user with favorites features based on the user's identity. Favoritesfeatures may include a user's favorite television channels, websites,audio channels, or any other suitable feature. For example, a voicecommand from a user may be received by a microphone. A voice recognitionsystem, such as voice recognition system 435 of FIG. 4, may determinethe user's identity. The voice recognition system may be able toidentify a user by the sound of the voice giving the voice command, bythe microphone used to give the voice command (e.g., by associating aspecific user with a specific microphone for receiving voice commands),or by any other suitable approach for user identification. The user maybe provided with favorites features based on the user's identity.

A flowchart of illustrative steps involved in providing a user withfavorites features based on the user's identity is shown in FIG. 18. Atstep 1710, a voice command from a user may be received. After receivingthe voice command from the user, the guidance application may determinethe identity of the user at step 1720. For example, the identity of auser may be determined by the sound of the voice giving the voicecommand, by the microphone used to give the voice command (e.g., byassociating a specific user with a specific microphone for receivingvoice commands), or by any other suitable approach for useridentification. The user may be provided with favorites features basedon the user's identity at step 1730. For example, the user's favoritetelevision channels, favorite websites, favorite audio channels, or anyother suitable favorites feature may be provided. The above example ismerely illustrative, and any suitable steps may be employed in order toprovide a user with favorites features based on the user's identity.

Thus, systems and methods for providing a user interface with speechrecognition to allow users to navigate within interactive media guidanceapplications are provided. It will be understood that the foregoing isonly illustrative of the principles of the invention and that variousmodifications can be made by those skilled in the art without departingfrom the scope and spirit of the invention, which is limited only by theclaims that follow.

1.-153. (canceled)
 154. A system for providing customized features basedon voice controls, comprising processing circuitry configured to:receive a voice command from a user at a first device; identify the userbased on the received voice command; and generate a presentation of afeature that is customized for the identified user.
 155. The system ofclaim 154, wherein the feature comprises a favorite media source ormedia content of the identified user.
 156. The system of claim 154,wherein the feature comprises a favorites list that includes mediacontent or media sources that are preferred by the identified user. 157.The system of claim 154, wherein the feature comprises a targetedadvertisement selected based on the identified user.
 158. The system ofclaim 154, wherein the processing circuitry is further configured to:receive a request to play media content, wherein the voice commandcomprises the request; and in response to receiving the request: selecta media content based on an association between the identified user andthe media content; and generate a presentation of the media content tothe identified user.
 159. The system of claim 154, wherein the firstdevice comprises a display and speaker configured to present the featureto the identified user.
 160. The system of claim 159, wherein: the voicecommand comprises a request for information identifying media contentthat is available on the first device, and the first device isconfigured to present the information in response to the request. 161.The system of claim 160, wherein the request is executed by a seconddevice.
 162. The system of claim 154, wherein: the first devicecomprises a microphone that receives the voice command, and theprocessing circuitry associates the user with the microphone.
 163. Thesystem of claim 154, wherein the processing circuitry identifies theuser based on a sound of a voice of the user in the voice command. 164.A method for providing customized features based on voice controls,comprising: receiving a voice command from a user at a first device;identifying the user based on the received voice command; and generatinga presentation of a feature that is customized for the identified user.165. The method of claim 164, wherein the feature comprises a favoritemedia source or media content of the identified user.
 166. The method ofclaim 164, wherein the feature comprises a favorites list that includesmedia content or media sources that are preferred by the identifieduser.
 167. The method of claim 164, wherein the media content comprisesa targeted advertisement selected based on the identified user.
 168. Themethod of claim 164, further comprising: receiving a request to playmedia content, wherein the voice command comprises the request; and inresponse to receiving the request: selecting a media content based on anassociation between the identified user and the media content; andgenerating a presentation of the media content to the identified user.169. The method of claim 164, wherein the first device comprises adisplay and speaker configured to present the media content to theidentified user.
 170. The method of claim 169, wherein: the voicecommand comprises a request for information identifying media contentthat is available on the first device, and the first device isconfigured to present the information in response to the request. 171.The method of claim 164, further comprising executing the media controlrequested by the user on a second device.
 172. The method of claim 164,further comprising: receiving the voice command on a microphone; andassociating the user with the microphone.
 173. The method of claim 164,wherein the identifying the user comprises identifying the user based ona sound of a voice of the user in the voice command.